Search CORE

42 research outputs found

Protein structure search and local structure characterization

Author: A Andreeva
AC Camproux
AG de Brevern
AG de Brevern
AG de Brevern
AR Ortiz
B Offmann
B Rost
C Benros
C Bystroff
CA Orengo
D Baker
E Appella
F Birzele
F Guyon
G Pollastri
HM Berman
IN Shindyalo
J Garnier
J Schuchhardt
J Vesanto
JA Hartigan
JM Yang
JS Fetrow
L Holm
M Carpentier
M Dudev
M Tyagi
M Tyagi
M Tyagi
NJ Mulder
O Sander
R Unger
S Henikoff
Shih-Yen Ku
T Madej
TL Bailey
TM Mitchell
TN Petersen
U Hobohm
VS Gowri
W Humphrey
WM Zheng
WR Pearson
Y Liu
Y Ye
Yuh-Jyh Hu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Structural similarities among proteins can provide valuable insight into their functional mechanisms and relationships. As the number of available three-dimensional (3D) protein structures increases, a greater variety of studies can be conducted with increasing efficiency, among which is the design of protein structural alphabets. Structural alphabets allow us to characterize local structures of proteins and describe the global folding structure of a protein using a one-dimensional (1D) sequence. Thus, 1D sequences can be used to identify structural similarities among proteins using standard sequence alignment tools such as BLAST or FASTA. Results We used self-organizing maps in combination with a minimum spanning tree algorithm to determine the optimum size of a structural alphabet and applied the k-means algorithm to group protein fragnts into clusters. The centroids of these clusters defined the structural alphabet. We also developed a flexible matrix training system to build a substitution matrix (TRISUM-169) for our alphabet. Based on FASTA and using TRISUM-169 as the substitution matrix, we developed the SA-FAST alignment tool. We compared the performance of SA-FAST with that of various search tools in database-scale search tasks and found that SA-FAST was highly competitive in all tests conducted. Further, we evaluated the performance of our structural alphabet in recognizing specific structural domains of EGF and EGF-like proteins. Our method successfully recovered more EGF sub-domains using our structural alphabet than when using other structural alphabets. SA-FAST can be found at <url>http://140.113.166.178/safast/</url>. Conclusion The goal of this project was two-fold. First, we wanted to introduce a modular design pipeline to those who have been working with structural alphabets. Secondly, we wanted to open the door to researchers who have done substantial work in biological sequences but have yet to enter the field of protein structure research. Our experiments showed that by transforming the structural representations from 3D to 1D, several 1D-based tools can be applied to structural analysis, including similarity searches and structural motif finding.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Assignment of PolyProline II Conformation and Analysis of Sequence – Structure Relationship

Author: A Bornot
A Kentsis
A Rath
AA Adzhubei
AA Adzhubei
AG de Brevern
AG de Brevern
AG de Brevern
AG de Brevern
AG de Brevern
AG de Brevern
Agnel Praveen Joseph
AK Jha
Alexandre G. de Brevern
AP Joseph
AP Joseph
AW Chan
B Hess
B Offmann
B Zagrovic
BJ Stapley
BK Kay
BW Chellgren
BW Chellgren
C Etchebest
CM Venkatachalam
CY Wu
D Eisenberg
D Frishman
D van der Spoel
DA Beck
E Lindahl
E Polverini
EJ Thompson
EW Blanch
F Avbelj
F Eker
FC Bernstein
FC Peterson
FM Richards
G Darnell
G Faure
G Faure
G Labesse
G Wang
G Wang
GB Banks
GD Rose
HJC Berendsen
HM Berman
J Esque
J Makowska
J Martin
J Martin
J Martin
JC Horng
JC Kendrew
Jean-Christophe Gelly
JM Hicks
JS Richardson
JS Richardson
K Chen
L Fourrier
L Pauling
L Pauling
L Pauling
L Pauling
LL Perskie
LL Porter
LR Rabiner
M Bansal
M Dudev
M Kuemin
M Mezei
M Tyagi
M Tyagi
M Tyagi
M Tyagi
M Tyagi
MA Kelly
Markus Buehler
MB Swindells
ML Tiffany
MV Cubellis
MV Cubellis
N Colloc'h
N Sreerama
NC Fitzkee
PK Vlasov
PL Obuchowski
PM Cowan
R Berisio
R Srinivasan
RV Pappu
S Arnott
S Jun
S Kutter
SA Hollingsworth
SJ Whittington
SM King
T Kameda
T Kohonen
TP Creamer
TP Creamer
V Sasisekharan
W Kabsch
WL Jorgensen
Y Watanabe
Yohann Mansiaux
Z Liu
Z Shi
Z Shi
Z Shi
Z Shi
Z Shi
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

International audienceBACKGROUND: Secondary structures are elements of great importance in structural biology, biochemistry and bioinformatics. They are broadly composed of two repetitive structures namely α-helices and β-sheets, apart from turns, and the rest is associated to coil. These repetitive secondary structures have specific and conserved biophysical and geometric properties. PolyProline II (PPII) helix is yet another interesting repetitive structure which is less frequent and not usually associated with stabilizing interactions. Recent studies have shown that PPII frequency is higher than expected, and they could have an important role in protein - protein interactions. METHODOLOGY/PRINCIPAL FINDINGS: A major factor that limits the study of PPII is that its assignment cannot be carried out with the most commonly used secondary structure assignment methods (SSAMs). The purpose of this work is to propose a PPII assignment methodology that can be defined in the frame of DSSP secondary structure assignment. Considering the ambiguity in PPII assignments by different methods, a consensus assignment strategy was utilized. To define the most consensual rule of PPII assignment, three SSAMs that can assign PPII, were compared and analyzed. The assignment rule was defined to have a maximum coverage of all assignments made by these SSAMs. Not many constraints were added to the assignment and only PPII helices of at least 2 residues length are defined. CONCLUSIONS/SIGNIFICANCE: The simple rules designed in this study for characterizing PPII conformation, lead to the assignment of 5% of all amino as PPII. Sequence - structure relationships associated with PPII, defined by the different SSAMs, underline few striking differences. A specific study of amino acid preferences in their N and C-cap regions was carried out as their solvent accessibility and contact patterns. Thus the assignment of PPII can be coupled with DSSP and thus opens a simple way for further analysis in this field

Public Library of Science (PLOS)

Crossref

HAL-Inserm

Directory of Open Access Journals

PubMed Central

HAL Descartes

Hal-Diderot

ANGLOR: A Composite Machine-Learning Algorithm for Protein Backbone Torsion Angle Prediction

Author: AG de Brevern
AG de Brevern
C Branden
C Bystroff
C Mooney
C Zhang
CJC Burges
David Jones
DT Jones
H Chen
MH Zaman
MJ Wood
MV Berjanskii
NC Fitzkee
O Dor
O Zimmermann
R Karchin
R Kuang
S Haykin
S Neal
S Wu
S Wu
S Wu
SF Altschul
Sitao Wu
U Hobohm
V Vapnik
W Kabsch
Y Zhang
Y Zhang
Y Zhang
Yang Zhang
YM Huang
Publication venue: Public Library of Science
Publication date: 15/10/2008
Field of study

We developed a composite machine-learning based algorithm, called ANGLOR, to predict real-value protein backbone torsion angles from amino acid sequences. The input features of ANGLOR include sequence profiles, predicted secondary structure and solvent accessibility. In a large-scale benchmarking test, the mean absolute error (MAE) of the phi/psi prediction is 28°/46°, which is ∼10% lower than that generated by software in literature. The prediction is statistically different from a random predictor (or a purely secondary-structure-based predictor) with p-value <1.0×10−300 (or <1.0×10−148) by Wilcoxon signed rank test. For some residues (ILE, LEU, PRO and VAL) and especially the residues in helix and buried regions, the MAE of phi angles is much smaller (10–20°) than that in other environments. Thus, although the average accuracy of the ANGLOR prediction is still low, the portion of the accurately predicted dihedral angles may be useful in assisting protein fold recognition and ab initio 3D structure modeling

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

KU ScholarWorks

PubMed Central

svmPRAT: SVM-based Protein Residue Annotation Toolkit

Author: A Kernytsky
AG de Brevern
AG Murzin
AK Dunker
AR Kinjo
B Rost
C Etchebest
C Kauffman
Christopher Kauffman
DT Jones
DT Jones
G Karypis
G Pollastri
G Pollastri
GE Crooks
George Karypis
H Rangwala
Huzefa Rangwala
J Cheng
J Cheng
M Gribskov
O Noivirit-Brik
R Ahmed
R Karchin
R Sanchez
RC Whaley
S Ahmad
S Hirose
SF Altschul
T Joachims
T Schwede
V Vapnik
VN Vapnik
W Kabsch
Y Ofran
Z Dosztnyi
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Over the last decade several prediction methods have been developed for determining the structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate and generalizable prediction models. Results We present a general purpose protein residue annotation toolkit (<it>svm</it><monospace>PRAT</monospace>) to allow biologists to formulate residue-wise prediction problems. <it>svm</it><monospace>PRAT</monospace> formulates the annotation problem as a classification or regression problem using support vector machines. One of the key features of <it>svm</it><monospace>PRAT</monospace> is its ease of use in incorporating any user-provided information in the form of feature matrices. For every residue <it>svm</it><monospace>PRAT</monospace> captures local information around the reside to create fixed length feature vectors. <it>svm</it><monospace>PRAT</monospace> implements accurate and fast kernel functions, and also introduces a flexible window-based encoding scheme that accurately captures signals and pattern for training effective predictive models. Conclusions In this work we evaluate <it>svm</it><monospace>PRAT</monospace> on several classification and regression problems including disorder prediction, residue-wise contact order estimation, DNA-binding site prediction, and local structure alphabet prediction. <it>svm</it><monospace>PRAT</monospace> has also been used for the development of state-of-the-art transmembrane helix prediction method called TOPTMH, and secondary structure prediction method called YASSPP. This toolkit developed provides practitioners an efficient and easy-to-use tool for a wide variety of annotation problems. <it>Availability</it>: <url>http://www.cs.gmu.edu/~mlbio/svmprat</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Discovering structural motifs using a structural alphabet: Application to magnesium-binding sites

Author: AG de Brevern
AG de Brevern
C Bystroff
C Chotia
Carmay Lim
CH Schein
CJ Sigrist
DM Kristensen
FH Allen
H Iding
HM Berman
I Jonassen
IK McDonald
JA Cowan
JA Cowan
JD Watson
L Fourrier
M Petkovich
M Tyagi
M Tyagi
Minko Dudev
MM Harding
P Nordlund
R Kolodny
R Unger
RA Laskowski
SD Lahiri
T Dudev
T Dudev
T Dudev
VS Mathura
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Missing value imputation improves clustering and interpretation of gene expression microarray data

Author: AG de Brevern
D Wang
G Feten
H Kim
H Kuhn
H Yoshimoto
I Scheel
J Handl
J He
J Hu
J Tuikkala
JJ Wyrick
JL DeRisi
Johannes Tuikkala
Laura L Elo
M Al-Daoud
M Hirao
M Kankainen
M Ronen
M Shapira
MJ Brauer
O Troyanskaya
Olli S Nevalainen
P D'haeseleer
PT Spellman
R Jörnsten
S Oba
S Tavazoie
T Lange
Tero Aittokallio
TR Golub
X Gan
X Wang
Y Shi
Z Cai
Publication venue: BioMed Central
Publication date: 01/04/2008
Field of study

Abstract Background Missing values frequently pose problems in gene expression microarray experiments as they can hinder downstream analysis of the datasets. While several missing value imputation approaches are available to the microarray users and new ones are constantly being developed, there is no general consensus on how to choose between the different methods since their performance seems to vary drastically depending on the dataset being used. Results We show that this discrepancy can mostly be attributed to the way in which imputation methods have traditionally been developed and evaluated. By comparing a number of advanced imputation methods on recent microarray datasets, we show that even when there are marked differences in the measurement-level imputation accuracies across the datasets, these differences become negligible when the methods are evaluated in terms of how well they can reproduce the original gene clusters or their biological interpretations. Regardless of the evaluation approach, however, imputation always gave better results than ignoring missing data points or replacing them with zeros or average values, emphasizing the continued importance of using more advanced imputation methods. Conclusion The results demonstrate that, while missing values are still severely complicating microarray data analysis, their impact on the discovery of biologically meaningful gene groups can – up to a certain degree – be reduced by using readily available and relatively fast imputation methods, such as the Bayesian Principal Components Algorithm (BPCA).</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Prediction of backbone dihedral angles and protein secondary structure using support vector machines

Author: AG de Brevern
AG Murzin
AK Jain
AP Dempster
B Oliva
B Rost
B Rost
B Rost
B Xue
BH Park
BW Matthews
C Bystroff
C Bystroff
C Mooney
CB Anfinsen
CC Chang
CW Hsu
D Frishman
D Przybylski
DT Jones
DT Jones
E Faraggi
FM Richards
G Karypis
G Pollastri
GN Ramachandran
H Kim
IH Witten
J Guo
J Kyte
J MacQueen
JA Cuff
JA Cuff
JJ Ward
Jonathan D Hirst
JR Green
K Karplus
K Lin
KY Yeung
M Ouali
MJ Rooman
MJ Wood
N Cristianini
N Qian
O Dor
O Zimmermann
O Zimmermann
Petros Kountouris
PY Chou
Q Dong
R Karchin
R Kuang
S Henikoff
S Hua
S Qin
S Wu
SC Lovell
SF Altschul
SK Riis
U Hobohm
V Vapnik
W Kabsch
XM Pan
Y Xu
YM Huang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The prediction of the secondary structure of a protein is a critical step in the prediction of its tertiary structure and, potentially, its function. Moreover, the backbone dihedral angles, highly correlated with secondary structures, provide crucial information about the local three-dimensional structure. Results We predict independently both the secondary structure and the backbone dihedral angles and combine the results in a loop to enhance each prediction reciprocally. Support vector machines, a state-of-the-art supervised classification technique, achieve secondary structure predictive accuracy of 80% on a non-redundant set of 513 proteins, significantly higher than other methods on the same dataset. The dihedral angle space is divided into a number of regions using two unsupervised clustering techniques in order to predict the region in which a new residue belongs. The performance of our method is comparable to, and in some cases more accurate than, other multi-class dihedral prediction methods. Conclusions We have created an accurate predictor of backbone dihedral angles and secondary structure. Our method, called DISSPred, is available online at <url>http://comp.chem.nottingham.ac.uk/disspred/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ROTAS: a rotamer-dependent, atomic statistical potential for assessment and prediction of protein structures

Author: A Ben-Naim
A Fernández
A Figureau
A Kolinski
A Panjkovich
AG De Brevern
AG De Brevern
AG Turjanski
AJ Bordner
AM Ruvinsky
B John
B Park
B Qian
C Keasar
C Zhang
CE Metz
CM Deane
D Rykunov
DT Jones
F Melo
F Zhao
FE Boas
G Lamoureux
G Wang
H Lu
H Schrauber
H Zhou
H Zhou
HM Berman
I Jonassen
I Jonassen
J Janin
J Skolnick
J Xu
J Zhang
JM Word
Jungkap Park
JW Ponder
Kazuhiro Saitou
KE Johansson
KT Simons
L Wroblewska
M Lu
M-Y Shen
MJ Sippl
MJ Sippl
MV Shapovalov
N-V Buchete
NS Bogatyreva
P Benkert
P Cossio
PD Thomas
PJ Munson
R Samudrala
R Samudrala
RA Friesner
RK Singh
RL Dunbrack
S Karlin
S Mayewski
S Miyazawa
S Miyazawa
S Miyazawa
S Tanaka
SJ Wodak
T Bereau
T Hamelryck
T Kortemme
TA Halgren
Y Su
Y Wu
Y Xia
Y Yang
Y Zhang
Z-Y Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

TANGLE: Two-Level Support Vector Regression Approach for Protein Backbone Torsion Angle Prediction from Primary Sequences

Author: A Schlessinger
A Schlessinger
A Schlessinger
AG de Brevern
B Rost
B Rost
B Rost
B Xue
C Bystroff
C Haynes
C Mooney
C Zhang
C Zheng
Christian Schönbach
D Xie
DT Jones
E Faraggi
E Faraggi
G Helles
Geoffrey I. Webb
GN Ramachandran
GP Raghava
H Zhang
H Zhang
Hao Tan
HJ Dyson
HS Kang
J Cheng
J Gao
J Gsponer
J Song
J Song
J Song
J Song
J Song
J Song
Jiangning Song
JJ Ward
JS Chauhan
K Chen
K Chen
K Chen
L Chen
L Kurgan
M Kumar
Mingjun Wang
MJ Mizianty
MJ Rooman
MJ Wood
MJ Wood
MK Kalita
MN Nguyen
MN Nguyen
MV Berjanskii
O Dor
O Dor
O Zimmermann
P Chen
P Kountouris
P Kountouris
P Sliz
PC Chen
R Gaudet
R Karchin
R Kuang
R Verma
S Ahmad
S Ahmad
S Liang
S Qiu
S Wu
S Wu
SF Altschul
T Ishida
T Zhang
T Zhang
Tatsuya Akutsu
V Vapnik
V Vapnik
W Kabsch
W Liu
W Zhang
X Miao
X Wang
XY Pan
Y Ofran
Y Ofran
YM Huang
Z Markovic-Housley
Z Yuan
Z Yuan
Z Yuan
Z Yuan
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Protein backbone torsion angles (Phi) and (Psi) involve two rotation angles rotating around the Cα-N bond (Phi) and the Cα-C bond (Psi). Due to the planarity of the linked rigid peptide bonds, these two angles can essentially determine the backbone geometry of proteins. Accordingly, the accurate prediction of protein backbone torsion angle from sequence information can assist the prediction of protein structures. In this study, we develop a new approach called TANGLE (Torsion ANGLE predictor) to predict the protein backbone torsion angles from amino acid sequences. TANGLE uses a two-level support vector regression approach to perform real-value torsion angle prediction using a variety of features derived from amino acid sequences, including the evolutionary profiles in the form of position-specific scoring matrices, predicted secondary structure, solvent accessibility and natively disordered region as well as other global sequence features. When evaluated based on a large benchmark dataset of 1,526 non-homologous proteins, the mean absolute errors (MAEs) of the Phi and Psi angle prediction are 27.8° and 44.6°, respectively, which are 1% and 3% respectively lower than that using one of the state-of-the-art prediction tools ANGLOR. Moreover, the prediction of TANGLE is significantly better than a random predictor that was built on the amino acid-specific basis, with the p-value<1.46e-147 and 7.97e-150, respectively by the Wilcoxon signed rank test. As a complementary approach to the current torsion angle prediction algorithms, TANGLE should prove useful in predicting protein structural properties and assisting protein fold recognition by applying the predicted torsion angles as useful restraints. TANGLE is freely accessible at http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/TANGLE/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Monash University Research Portal

The primary headaches: genetics, epigenetics and a behavioural genetic model

Author: A Ambrosini
A Buchwalder
A Carlsson
A Ducros
A Joutel
A Jouvenceau
A MaassenVanDenBrink
A MaassenVanDenBrink
A May
A Oterino
A Petronis
A Vicentic
AD Ogilvie
AG Shepherd
AM Maagdenberg van den
AM Oommen
B Borroni
B Echenne
BL Hart
C Netzer
C Roberge
C Sjöstrand
C Sjöstrand
C Sjöstrand
C Tzourio
CA Bordini
CD Salpietro
D D’Amico
D D’Amico
D Soragna
D Svensson
D Vercelli
DR Nyholt
DR Nyholt
DR Nyholt
DR Nyholt
E Loder
EE Kors
EG Couturier
EJ Hong
EL Spierings
F Fernandez
F Gurkan
F Pierelli
F Riant
FA Champagne
G Burlet
G Juhasz
G Terwindt
GM Terwindt
GM Terwindt
H Chabriat
H Kowa
HC Siow
Headache Classification Subcommittee of the International Headache Society
HT Bjornsson
I Alonso
I Hovatta
I Kara
I Rainero
I Rainero
I Rainero
I Rainero
IC Weaver
IC Weaver
II Gottesman
J Corral
J Cuypers
J Haan
J Haan
J Haan
J Ojaimi
JA Iniesta
JC Jen
JC Michael
JJ Plomp
JM Levenson
JN Blau
JN Blau
JS Kim
K Beauvais
K Gardner
K Jurkat-Rott
K Majamaa
KJ Swoboda
KM Welch
KR Merikangas
KR Vanmolkot
KR Vanmolkot
KW Jones
L Baumber
L Monari
L Monari
L Russo
L Savi
LC McCarthy
LM Cupini
LR Griffiths
M Brevern von
M Devoto
M Dichgans
M Dichgans
M Dichgans
M El Amrani
M Fusco De
M Giacovazzo
M Kirchmann
M Marziniak
M Mochi
M Mochi
M Mochi
M Odawara
M Schürks
M Spadaro
M Spranger
M Wessman
M Yilmaz
M Zompo Del
MB Russell
MB Russell
MB Russell
MB Russell
MB Russell
MB Russell
MB Russell
ME Erdal
MF Fraga
MJ Mortimer
MJ Mortimer
MP Johnson
MT Bassi
N Rebaudengo
NJ Colson
NJ Giffin
O Sjaastad
O Zhuchenko
P Aridon
P Cortelli
P Lulli
P Martelletti
P Montagna
P Montagna
P Montagna
P Rossi
P Seibel
Pasquale Montagna
R Alberca
R Brugnoni
R Curtain
R Curtain
R Curtain
R Mössner
R Simone De
RA Lea
RA Lea
RA Lea
RA Maselli
RA Ophoff
RA Ophoff
RG Boles
RG Boles
RL Jirtle
RP Curtain
S Auvin
S Cevoli
S Cevoli
S Kaja
S Maude
S Noble-Topham
S Paterna
S Schuh-Hofer
S Soriani
S Ventegodt
SH Subramony
SJ Peroutka
T Shimomura
T Wieser
T Yoshihara
TC Chen
U Todt
V Marini
V Ulrich
V Ulrich
Y Goto
YJ Karten
ZM Cader
Publication venue: Springer Milan
Publication date: 01/01/2008
Field of study

The primary headaches, migraine with (MA) and without aura (MO) and cluster headache, all carry a substantial genetic liability. Familial hemiplegic migraine (FHM), an autosomal dominant mendelian disorder classified as a subtype of MA, is due to mutations in genes encoding neural channel subunits. MA/MO are considered multifactorial genetic disorders, and FHM has been proposed as a model for migraine aetiology. However, a review of the genetic studies suggests that the FHM genes are not involved in the typical migraines and that FHM should be considered as a syndromic migraine rather than a subtype of MA. Adopting the concept of syndromic migraine could be useful in understanding migraine pathogenesis. We hypothesise that epigenetic mechanisms play an important role in headache pathogenesis. A behavioural model is proposed, whereby the primary headaches are construed as behaviours, not symptoms, evolutionarily conserved for their adaptive value and engendered out of a genetic repertoire by a network of pattern generators present in the brain and signalling homeostatic imbalance. This behavioural model could be incorporated into migraine genetic research

Crossref

Springer - Publisher Connector

PubMed Central

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna